AITopics | Missoula

Collaborating Authors

Missoula

What would happen if Yellowstone's 'supervolcano' erupted today?

What would happen if Yellowstone's'supervolcano' erupted today? Say goodbye to Montana, Wyoming, and Idaho. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. This photo of a volcano in Iceland doesn't even begin to encapsulate the devastation that would happen if the Yellowstone volcano erupted. Breakthroughs, discoveries, and DIY tips sent six days a week.

artificial intelligence, eruption, yellowstone, (12 more...)

Popular Science

Country:

North America > United States > Wyoming (0.26)
North America > United States > Idaho (0.25)
Europe > Iceland (0.25)
(4 more...)

Genre: Research Report > New Finding (0.35)

Industry:

Energy (0.71)
Media (0.70)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

Time-Warping Recurrent Neural Networks for Transfer Learning

Hirschi, Jonathon

arXiv.org Machine LearningApr-6-2026

Dynamical systems describe how a physical system evolves over time. Physical processes can evolve faster or slower in different environmental conditions. We use time-warping as rescaling the time in a model of a physical system. This thesis proposes a new method of transfer learning for Recurrent Neural Networks (RNNs) based on time-warping. We prove that for a class of linear, first-order differential equations known as time lag models, an LSTM can approximate these systems with any desired accuracy, and the model can be time-warped while maintaining the approximation accuracy. The Time-Warping method of transfer learning is then evaluated in an applied problem on predicting fuel moisture content (FMC), an important concept in wildfire modeling. An RNN with LSTM recurrent layers is pretrained on fuels with a characteristic time scale of 10 hours, where there are large quantities of data available for training. The RNN is then modified with transfer learning to generate predictions for fuels with characteristic time scales of 1 hour, 100 hours, and 1000 hours. The Time-Warping method is evaluated against several known methods of transfer learning. The Time-Warping method produces predictions with an accuracy level comparable to the established methods, despite modifying only a small fraction of the parameters that the other methods modify.

artificial intelligence, machine learning, prediction, (20 more...)

arXiv.org Machine Learning

2604.02474

Country:

North America > United States > Colorado > Denver County > Denver (0.14)
North America > United States > Oklahoma (0.06)
North America > United States > Rocky Mountains (0.04)
(15 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Does Local News Stay Local?: Online Content Shifts in Sinclair-Acquired Stations

Wanner, Miriam, Hager, Sophia, Field, Anjalie

arXiv.org Artificial IntelligenceOct-9-2025

Local news stations are often considered to be reliable sources of non-politicized information, particularly local concerns that residents care about. Because these stations are trusted news sources, viewers are particularly susceptible to the information they report. The Sinclair Broadcast group is a broadcasting company that has acquired many local news stations in the last decade. We investigate the effects of local news stations being acquired by Sinclair: how does coverage change? We use computational methods to investigate changes in internet content put out by local news stations before and after being acquired by Sinclair and in comparison to national news outlets. We find that there is clear evidence that local news stations report more frequently on national news at the expense of local topics, and that their coverage of polarizing national topics increases.

artificial intelligence, natural language, social media, (17 more...)

arXiv.org Artificial Intelligence

2510.0706

Country:

Asia > Middle East (1.00)
Europe (0.94)
North America > United States > Montana > Missoula County > Missoula (0.28)
North America > United States > Rhode Island > Providence County > Providence (0.28)

Genre: Research Report (1.00)

Industry:

Media > News (1.00)
Leisure & Entertainment > Sports > Football (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications > Social Media (0.94)

Add feedback

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Li, Xiaoxi, Jin, Jiajie, Zhou, Yujia, Wu, Yongkang, Li, Zhonghua, Ye, Qi, Dou, Zhicheng

arXiv.org Artificial IntelligenceDec-16-2024

Large language models (LLMs) exhibit remarkable generative capabilities but often suffer from hallucinations. Retrieval-augmented generation (RAG) offers an effective solution by incorporating external knowledge, but existing methods still face several limitations: additional deployment costs of separate retrievers, redundant input tokens from retrieved text chunks, and the lack of joint optimization of retrieval and generation. To address these issues, we propose \textbf{RetroLLM}, a unified framework that integrates retrieval and generation into a single, cohesive process, enabling LLMs to directly generate fine-grained evidence from the corpus with constrained decoding. Moreover, to mitigate false pruning in the process of constrained evidence generation, we introduce (1) hierarchical FM-Index constraints, which generate corpus-constrained clues to identify a subset of relevant documents before evidence generation, reducing irrelevant decoding space; and (2) a forward-looking constrained decoding strategy, which considers the relevance of future sequences to improve evidence accuracy. Extensive experiments on five open-domain QA datasets demonstrate RetroLLM's superior performance across both in-domain and out-of-domain tasks. The code is available at \url{https://github.com/sunnynexus/RetroLLM}.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2412.11919

Country:

North America > United States > District of Columbia > Washington (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > California > San Francisco County > San Francisco (0.04)
(24 more...)

Genre:

Personal > Honors (1.00)
Research Report > New Finding (0.67)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Media > Music (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

FLAME 3 Dataset: Unleashing the Power of Radiometric Thermal UAV Imagery for Wildfire Management

Hopkins, Bryce, ONeill, Leo, Marinaccio, Michael, Rowell, Eric, Parsons, Russell, Flanary, Sarah, Nazim, Irtija, Seielstad, Carl, Afghah, Fatemeh

arXiv.org Artificial IntelligenceDec-3-2024

The increasing accessibility of radiometric thermal imaging sensors for unmanned aerial vehicles (UAVs) offers significant potential for advancing AI-driven aerial wildfire management. Radiometric imaging provides per-pixel temperature estimates, a valuable improvement over non-radiometric data that requires irradiance measurements to be converted into visible images using RGB color palettes. Despite its benefits, this technology has been underutilized largely due to a lack of available data for researchers. This study addresses this gap by introducing methods for collecting and processing synchronized visual spectrum and radiometric thermal imagery using UAVs at prescribed fires. The included imagery processing pipeline drastically simplifies and partially automates each step from data collection to neural network input. Further, we present the FLAME 3 dataset, the first comprehensive collection of side-by-side visual spectrum and radiometric thermal imagery of wildland fires. Building on our previous FLAME 1 and FLAME 2 datasets, FLAME 3 includes radiometric thermal Tag Image File Format (TIFFs) and nadir thermal plots, providing a new data type and collection method. This dataset aims to spur a new generation of machine learning models utilizing radiometric thermal imagery, potentially trivializing tasks such as aerial wildfire detection, segmentation, and assessment. A single-burn subset of FLAME 3 for computer vision applications is available on Kaggle with the full 6 burn set available to readers upon request.

artificial intelligence, flame 3, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2412.02831

Country:

North America > United States > Montana > Missoula County > Missoula (0.14)
North America > United States > Arizona (0.04)
North America > United States > Rocky Mountains (0.04)
(6 more...)

Genre: Research Report (0.82)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Transportation > Air (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
(2 more...)

Add feedback

Exploring the Impact of a Transformer's Latent Space Geometry on Downstream Task Performance

Marbut, Anna C., Chandler, John W., Wheeler, Travis J.

arXiv.org Artificial IntelligenceJun-17-2024

It is generally thought that transformer-based large language models benefit from pre-training by learning generic linguistic knowledge that can be focused on a specific task during fine-tuning. However, we propose that much of the benefit from pre-training may be captured by geometric characteristics of the latent space representations, divorced from any specific linguistic knowledge. In this work we explore the relationship between GLUE benchmarking task performance and a variety of measures applied to the latent space resulting from BERT-type contextual language models. We find that there is a strong linear relationship between a measure of quantized cell density and average GLUE performance and that these measures may be predictive of otherwise surprising GLUE performance for several non-standard BERT-type models from the literature. These results may be suggestive of a strategy for decreasing pre-training requirements, wherein model initialization can be informed by the geometric characteristics of the model's latent space.

latent space, preprint arxiv, representation, (16 more...)

arXiv.org Artificial Intelligence

2406.12159

Country:

North America > United States > Montana > Missoula County > Missoula (0.14)
North America > United States > Arizona > Pima County > Tucson (0.14)
North America > United States > Indiana (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Can Large Language Models Learn the Physics of Metamaterials? An Empirical Study with ChatGPT

Lu, Darui, Deng, Yang, Malof, Jordan M., Padilla, Willie J.

arXiv.org Artificial IntelligenceApr-23-2024

Large language models (LLMs) such as ChatGPT, Gemini, LlaMa, and Claude are trained on massive quantities of text parsed from the internet and have shown a remarkable ability to respond to complex prompts in a manner often indistinguishable from humans. We present a LLM fine-tuned on up to 40,000 data that can predict electromagnetic spectra over a range of frequencies given a text prompt that only specifies the metasurface geometry. Results are compared to conventional machine learning approaches including feed-forward neural networks, random forest, linear regression, and K-nearest neighbor (KNN). Remarkably, the fine-tuned LLM (FT-LLM) achieves a lower error across all dataset sizes explored compared to all machine learning approaches including a deep neural network. We also demonstrate the LLM's ability to solve inverse problems by providing the geometry necessary to achieve a desired spectrum. LLMs possess some advantages over humans that may give them benefits for research, including the ability to process enormous amounts of data, find hidden patterns in data, and operate in higher-dimensional spaces. We propose that fine-tuning LLMs on large datasets specific to a field allows them to grasp the nuances of that domain, making them valuable tools for research and analysis.

absorptivity, metasurface, resonator, (17 more...)

arXiv.org Artificial Intelligence

2404.15458

Country:

North America > United States > Montana > Missoula County > Missoula (0.14)
Asia > Middle East > Jordan (0.05)
North America > United States > North Carolina > Durham County > Durham (0.04)

Genre: Research Report (1.00)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Reliable Measures of Spread in High Dimensional Latent Spaces

Marbut, Anna C., McKinney-Bock, Katy, Wheeler, Travis J.

arXiv.org Artificial IntelligenceJul-31-2023

Understanding geometric properties of natural language processing models' latent spaces allows the manipulation of these properties for improved performance on downstream tasks. One such property is the amount of data spread in a model's latent space, or how fully the available latent space is being used. In this work, we define data spread and demonstrate that the commonly used measures of data spread, Average Cosine Similarity and a partition function min/max ratio I(V), do not provide reliable metrics to compare the use of latent space across models. We propose and examine eight alternative measures of data spread, all but one of which improve over these current metrics when applied to seven synthetic data distributions. Of our proposed measures, we recommend one principal component-based measure and one entropy-based measure that provide reliable, relative measures of spread and can be used to compare models of different sizes and dimensionalities.

dimension, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2212.08172

Country:

North America > United States > Montana > Missoula County > Missoula (0.14)
North America > United States > Arizona > Pima County > Tucson (0.14)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Catherine Lacey's Provocative Novel in Disguise

The New YorkerMay-2-2023, 00:07:39 GMT

The first thing that you notice about Catherine Lacey's new novel is the lack of a determiner. Nouns float, unhooked from any article. I found myself habitually inserting "The" in the title when the book came up in conversation, that brief sound of specificity, the most common word in the English language and the most wishful. Darkness lifts to reveal a second, nested title page, for a slightly different book: "Biography of X," by C. M. Lucca. Both title pages mention the same publisher, Farrar, Straus & Giroux.

goldman, lacey, lucca, (13 more...)

The New Yorker

Country:

North America > United States > New York (0.05)
North America > United States > Montana > Missoula County > Missoula (0.05)
North America > United States > Mississippi (0.05)
(3 more...)

Genre: Summary/Review (0.50)

Industry: Government > Regional Government > North America Government > United States Government (0.48)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Deep Active Learning for Scientific Computing in the Wild

Ren, Simiao, Deng, Yang, Padilla, Willie J., Collins, Leslie, Malof, Jordan

arXiv.org Artificial IntelligenceJan-31-2023

Deep learning (DL) is revolutionizing the scientific computing community. To reduce the data gap caused by usually expensive simulations or experimentation, active learning has been identified as a promising solution for the scientific computing community. However, the deep active learning (DAL) literature is currently dominated by image classification problems and pool-based methods, which are not directly transferrable to scientific computing problems, dominated by regression problems with no pre-defined 'pool' of unlabeled data. Here for the first time, we investigate the robustness of DAL methods for scientific computing problems using ten state-of-the-art DAL methods and eight benchmark problems. We show that, to our surprise, the majority of the DAL methods are not robust even compared to random sampling when the ideal pool size is unknown. We further analyze the effectiveness and robustness of DAL methods and suggest that diversity is necessary for a robust DAL for scientific computing problems.

artificial intelligence, dal method, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2302.00098

Country:

Europe > Netherlands > South Holland > Delft (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > North Carolina > Durham County > Durham (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback